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5 FIELD OF THE INVENTION 

The present invention generally relates to methods for the production of 
proteins in host cells. Specifically, the present invention relates to the use of a 
chaperonin binding domain in expression systems designed for the production of 
proteins in host cells. 

10 

BACKGROUND OF THE INVENTION 

One of the simplest and most inexpensive ways to obtain large quantities of 
desired polypeptides for commercial or research uses is through the expression of 
heterologous genes in bacterial cells. Often however, net accumulation of the 

15 recombinant polypeptide is low due to degradation, missfoldings or aggregation. 
Also, many recombinant polypeptides fail to attain their correct three-dimensional 
conformation in E.coli and are found sequestered within large refractile aggregates, 
i.e., inclusion bodies. Processes for recovering active polypeptides from inclusion 
bodies can be complex and expensive. Additionally, in some cases, the large scale 

20 production of a protein is limited by the toxicity of the overexpressed protein toward 
the host cell or the accumulation of proteins as inclusion bodies that impede their 
recovery and purification. 

In the cell, a class of accessory proteins known as molecular chaperones 
function by interacting with nascent polypeptide chains and aid in the process of 

25 correct folding Georgiou et al. (1996, Current Opinion in Biotechnology, 7:190-197). 
Molecular chaperones are highly conserved proteins found in all organisms that 
control and sometimes catalyse the ATP-dependent folding of newly synthesized 
proteins and polypeptides as they are produced in cells. Chaperones mediate the 
stabilization and refolding of proteins under conditions of stress and are believed to 

30 fold crucial portions of proteins, such as enzymes, independently (Hendrick, J. P., 
1993, Ann. Rev. Biochem . 62: 349-384). 

Several E.coli proteins have been shown to exhibit chaperone activity: the 
60kDa heat shock protein (Hsp60) GroEL, a chaperonin and the smaller accessory 
protein GroES (10kDa); the DnaK (Hsp70), DnaJ and GrpE complex; and the Clp 

35 system. Georgiou et al. supra. GroEL consists of 14 subunits which are arranged in 
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two heptameric rings stacked back to back. The central cavity of the cylinder 
accepts unfolded substrate polypeptides in the conformation of a collapsed 
intermediate. GroEL interacts with GroES, a single heptameric ring that binds 
asymetrically to GroEL, capping one opening of the cylinder. GroES coordinates the 
5 ATP hydrolysis by GroEL with productive folding (Mayhew, M et al., Nature vol. 
379:420-426.) 

Dale, G. et al. (1994, Protein Engineering 7:925-931) report that 
simultaneous overproduction of the GroEL/GroES chaperonins with dihydrofolate 
reductase results in an increased solubility of the enzyme and Amrein, K. et al. 

10 (1995, Proc. Natl. Acad. Sci. vol 92: 1048-1052) report on the purification of 
recombinant human protein-tyrosine kinase in an E.coli expression system 
overproducing the bacterial chaperonins GroES and GroEL. 

Landry, S. et al. (1993, Nature 364:255-258) disclose a polypeptide loop of 
the GroES/GroEL complex and Altamirano et al. (1997, Proc. Natl. Acad. Sci. USA, 

15 94:3576-3578) disclose the use of immobilized fragments of the GroEL chaperonin in 
chromatography. 

In spite of advances in understanding chaperonins and the production of 
proteins in host cells, there remains a need to develop expression vectors and 
systems which allow for production of proteins in host cells. 

20 

SUMMARY OF THE INVENTION 

The present invention generally relates to chaperonin protein binding 
domains and to the use of an isolated chaperonin protein binding domain in the 
production of heterologous proteins, peptides or polypeptides in a host cell. The 

25 present invention is based, in part, upon the finding that a toxic gene product could 
be recombinantly produced by a host cell when expressed as a fusion protein 
associated with an isolated chaperonin binding domain. 

Accordingly, the present invention provides a method for producing a protein 
in a host cell, comprising the step of culturing a host cell comprising a first nucleic 

30 acid encoding an isolated chaperonin binding domain associated with a second 
nucleic acid encoding the protein and a third nucleic acid encoding a chaperonin 
under conditions suitable for expression of said first, said second and said third 
nucleic acid and wherein said chaperonin binding domain is capable of binding to 
said chaperonin. In a further embodiment, the chaperonin binding domain and the 
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chaperonin are capable of binding with an affinity of between about 10' 2 and 10" 8 Kd. 
The method may further comprise recovering said protein from said cell. In one 
aspect, the protein is one toxic to the host cell. A protein may be toxic to a host cell 
due to its intrinsic nature or toxic due to the presence of elevated levels in the host 
5 cell. 

In another embodiment of the present invention, the first and second nucleic 
acid encode a fusion protein. The first and second nucleic acid may be directly 
linked or indirectly linked by nucleic acid encoding an enzymatic cleavage site, a 
chemical cleavage site, or another protein or peptide. 

10 In one aspect of the invention, nucleic acid encoding the chaperonin is 

naturally produced by the host cell and the cell is grown under conditions that result 
in elevated levels of the chaperonin. In another aspect, nucleic acid encoding the 
chaperonin is heterologous to the host cell and the heterologous chaperonin is under 
the control of at least one expression signal capable of overexpressing the 

15 chaperonin in the host cell. The present invention encompasses any host cell that is 
capable of expression of recombinant proteins. In one embodiment, the host cell is a 
bacterium. In another embodiment, the host cell is a eubacterium. In yet further 
embodiments, the host cell is a gram-positive or a gram-negative bacterium. In a 
further embodiment, the bacterial cell is a member of the family Enterobacteriaceae.. 

20 In an additional embodiment, the bacterial cell is an Escherichia species, in particular 
E. coli. 

There are several well characterized chaperonin systems known in the art 
having two or more interacting partners, for example, Hsp60 and Hsp10 
(GroEL/GroES); Hsp70 and Hsp40 and GrpE (DnaK/DNAJ/GrpE); ClipA/X and 

25 ClipP; Hsp90 and Hsp70 and other factors; TriC and other factors. The present 

invention encompasses chaperonin binding domains obtainable from these systems 
as long as the chaperonin binding domain is capable of binding to a chaperonin with 
an affinity of between about 10" 2 and 10" 8 Kd. In one embodiment, the chaperonin 
binding domain has the sequence as shown in SEQ ID NO: 1 through SEQ ID NO: 

30 38. In yet another embodiment, the chaperonin binding domain is obtainable from 
the GroES co-chaperonin and said chaperonin is the GroEL chaperonin. In another 
embodiment, the binding domain comprises the amino acid sequence 
E VETKSAGG IVLTGS AAA. In a further embodiment, the binding domain comprises 
a variation of the sequence E VETKSAGG I VLTGSAAA, said variant being capable of 
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binding to GroEL chaperonin with an affinity of 10" 2 to 10" 8 Kd. The present invention 
also provides expression vectors and host cells comprising a chaperonin protein 
binding domain. 

Examples of heterologous proteins include therapeutically significant 
5 proteins, such as growth factors, cytokines, ligands, receptors and inhibitors, as well 
as vaccines and antibodies; enzymes such as hydrolases including proteases, 
carbohydrases, and lipases; isomerases such as racemases, epimerases, 
tautomerases, or mutases; transferases, kinases and phophatases; and 
commercially important industrial proteins or polypeptides, such as proteases, 
10 carbohydrases such as amylases and glucoamylases, cellulases, oxidases and 
lipases. The nucleic acid encoding the heterologous protein may be naturally 
occurring, a variation of a naturally occurring protein or synthetic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 Figure 1 shows the nucleic acid (SEQ ID NO: 1) and amino acid (SEQ ID 

NO:2) for the region of the chaperonin containing the GroEL/GroES binding domain. 
Figure 2 shows the growth of JM105 in the presence of increasing amount of 

ethanol: diamonds, no ethanol; squares, 1%; triangles, 2%; X, 3%; *, 4% ethanol in 
LB media. 

20 Figure 3 shows the growth rate of BAX transformants with different leader 

sequences in the presence and absence of 2% ethanol. 

Figure 4 shows and analysis of proteins present upon induction of the host 
cells with IPTG and growth in the presence of ethanol. Lanes 1 & 5 - MW markers; 
lanes 2-4 - pATP01 1 (chaperonin binding domain); 6-8 - pWS213.(OmpA leader). 

25 Bar between lanes 3&4 and 7&8 indicates position of BAX protein on the gel. 

Figure 5 shows the design of a linker for attaching the GroEL-binding loop of 
GroES to proteins. Oligonucleotides matching the two sequences shown above 
were synthesized chemically, annealed to generate the duplex DNA fragment, and 
cloned into appropriate vectors. Linkage to a gene via the EcoRI overhang 

30 generates protein 20 amino acids (1905 Daltons) longer. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

Definitions 
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The in vivo cellular processes of protein folding and assembly are controlled 
by molecular mechanisms associated with molecular chaperones such as 
chaperonins. As used herein, the term "chaperonin" refers to those molecules 
including heat shock proteins Hsp60 and like proteins that are expressed in any 

5 organism which are associated with protein folding. The present invention 

encompasses any chaperonin from any microbial source, virus or bacteriophage 
including the chaperonin systems, Hsp60 and Hsp10 (GroEL/GroES); Hsp70 and 
Hsp40 and GrpE (DnaK/DNAJ/GrpE); ClipA/X and ClipP; Hsp90 and Hsp70; and 
TriC. In a preferred embodiment, the chaperonin binding domain and the chaperonin 

10 are obtainable from the heat shock protein 60 (HSP60) class of proteins. Other 
chaperonins include mammalian or yeast HSP68, HSP70, HSP72, HSP73, clathrin 
uncoating ATPase, IpG heavy chain binding protein (BiP), glucose-regulated 
proteins 75, 78, and 80 (GRP75, GRP78, and GRP80), HSC70, and yeast KAR2, 
BiP, SSA1-4, SSB1, SSD1 and the like. Chaperone proteins which can increase 

15 protein secretion also include enzymes which catalyze covalent modification of 
proteins, such as mammalian or yeast protein disulfide isomerase (PDI), prolyl-4- 
hydroxylase B-subunit, ER p59, glycosylation site binding protein (GSBR) and thyroid 
hormone binding protein (T3BP). 

Chaperonins are known to be associated with protein folding and anti- 

20 aggregation activities (Craig, et al., 1994, Cell vol. 78, 365-372; Hendrick, et al., 

1993, Annu. Rev. Biochem. Vol. 62, 349-84; Hartl, 1994, TIBS vol. 19:20-25). Often, 
multiple sub-units are associated with one chaperonin complex. The present 
invention encompasses each chaperonin sub-unit used individually or in combination 
with other subunits providing that the individual subunit or combination of subunits is 

25 able to function by binding to a chaperonin protein binding domain. In the present 
invention, preferred chaperonins are those present in members of the family 
Enterobacteriaceae and in particular from Eschericia species. In the present 
invention, a preferred chaperonin is the GroEL chaperonin which is associated with 
the co-chaperonin GroES. A chaperonin of the present invention may be naturally 

30 occurring in the host cell or heterologous to the host cell and may be introduced by 
recombinant means. 

As used herein, the phrase "isolated binding domain" of a chaperonin or 
"chaperonin protein binding domain" or "chaperonin binding domain" refers to a 
region of a protein or polypeptide that is able to bind with an affinity of between 10" 2 
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and 10" 8 Kd to a chaperonin or portion or fragment thereof of said chaperonin. In 
one embodiment of the present invention, the chaperonin binding domain has the 
sequence as shown in any of SEQ ID NO: 1 through SEQ ID NO:38. In another 
embodiment of the present invention, the chaperonin binding domain is obtainable 
5 from the GroES co-chaperonin. As used herein, the chaperonin protein binding 
domain obtainable from GroES refers to the residues shown in Figure 1 comprising 
the sequence EVETKSAGGIVLTGSAAA. In another embodiment, the binding 
domain comprises amino acid variations of EVETKSAGGIVLTGSAAA capable of 
binding to a GroEL chaperonin with an affinity of between 10~ 2 and 10" 8 Kd. A 

10 chaperonin protein binding domain is associated with a second nucleic acid encoding 
a heterologous protein when the first and second nucleic acids are directed linked, 
such as in a fusion protein, or are indirectly linked such as having an enzymatic 
cleavage site, chemical cleavage site or other nucleic acid inserted between the first 
and the second nucleic acid. 

15 As used herein, "nucleic acid" refers to a nucleotide or polynucleotide 

sequence, and fragments or portions thereof, and to DNA or RNA of genomic or 
synthetic origin which may be double-stranded or single-stranded, whether 
representing the sense or antisense strand. As used herein "amino acid" refers to 
peptide or protein sequences or portions thereof. 

20 The terms "isolated" or "purified" as used herein refer to a nucleic acid or 

amino acid that is removed from at least one component with which it is naturally 
associated. 

As used herein, the term "heterologous protein" refers to a protein or 
polypeptide that is encoded by nucleic acid introduced into a host cell. Examples of 

25 heterologous proteins include enzymes such as hydrolases including proteases, 
carbohyd rases, and lipases; isomerases such as racemases, epimerases, 
tautomerases, or mutases; transferases, kinases and phophatases. The 
heterologous gene may encode therapeutically significant proteins or peptides, such 
as growth factors, cytokines, ligands, receptors and inhibitors, as well as vaccines 

30 and antibodies. The gene may encode commercially important industrial proteins or 
peptides, such as proteases, carbohydrases such as amylases and glucoamylases, 
cellulases, oxidases and lipases. The gene of interest may be a naturally occurring 
gene, a mutated gene or a synthetic gene. The term "homologous protein" refers to 
a protein or polypeptide that naturally occurs in the host cell. The present invention 
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encompasses homologous proteins that are introduced into the host cell via 
recombinant means. The term "toxic" as used herein refers to any protein that 
inhibits the growth of a bacterial cell. A protein may be toxic to a host cell due to an 
intrinsic harmful nature or due to expression levels in the host bacterial cell. An 

5 illustrative example of a toxic protein disclosed herein is the mouse apoptosis 
modulator protein, Bax. Examples of proteins considered to be toxic due to their 
intrinsic nature include nucleoses, proteoses and phospholiposes. 

As used herein, the term "overexpressing" when referring to the production of 
a protein in a host cell means that the protein is produced in greater amounts than it 

10 is produced in its naturally occurring environment. 

Detailed Description of the Preferred Embodiments 

The present invention provides a means for the production of proteins or 
polypeptides in a host cell, especially proteins or polypeptides that are toxic to the 

15 cell due to the protein's intrinsic nature or due to expression levels of the protein 
produced recombinantly in the cell. 

The present invention provides methods for producing a protein in a host cell 
wherein the cell comprises nucleic acid encoding a fusion protein comprising a 
chaperonin binding domain and the protein and wherein the cell naturally produces a 

20 chaperonin that binds to the chaperonin binding domain. In this embodiment, the 
host cell is grown under conditions suitable for inducing or enhancing the levels of 
the naturally occurring chaperonin. The present invention encompasses methods for 
producing a protein in a host cell wherein the cell comprises nucleic acid encoding a 
fusion protein comprising a chaperonin binding domain and protein and said host cell 

25 further comprises nucleic acid encoding a chaperonin that has been recombinantly 
introduced into said host cell. In this embodiment, the chaperonin may be 
homologous or heterologous to said host cell and is associated with expression 
signals capable of overexpressing the chaperonin. 

In an illustrative example disclosed herein, the mammalian gene bax, a 

30 member of the bcl-2 family of apoptosis modulators (Oltvai, Z.N., Milliman, C.L., and 
Korsmeyer, S.J. (1993) Bcl-2 Heterodimerizes in vivo with a conserved Homolog, 
Bax, that accelerates Programmed cell death, Cell 74, 609-619) was used. Although 
homologous to bcl-2 and bcl-X L , which inhibit apoptosis, bax has the opposite 
function and is an effector of cell death (McDonnell, T.J., et a!., 1996, Importance of 
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the Bcl-2 family in cell death regulation, Experientia 52, 1008-1017). These three 
genes were expressed in E. coli as fusions with the OmpA leader sequence. Bcl-2 
and Bcl-X L proteins were produced in the periplasm of E. coli, but only trace amounts 
of Bax was produced by this approach. Expression of Bax appeared to be highly 

5 toxic to the host cell. No expression of the native form of Bax was observed from 
any clones when we placed the bax cDNA sequence adjacent to the lac promoter. 
When the bax gene was linked to a chaperonin binding domain obtainable from 
GroES and produced as a fusion protein in E.coli simultaneous with overproduction 
of the GroEL chaperonin, overexpression of the Bax protein was observed, 

10 suggesting that the toxic effect of Bax on the host cell had been reduced. 

I. Chaperonin nucleic acid and amino acid sequences 

The present invention encompasses chaperonin proteins that are associated 

with increased protein secretion and those that are associated with the folding and 
15 unfolding of polypeptides, including but not limited to, the heat shock 60 family of 

proteins (Hsp60). 

In one embodiment, the chaperonin is obtained from an organism listed in 
Table I and has the respective chaperonin binding domain as given in Table I. In a 
preferred embodiment herein, the chaperonin is GroEL the nucleic acid and amino 

20 acid sequence of which is disclosed in Hemmingsen, et al., 1988, Nature vol. 333, 
pages 330-334 . A method for isolation of GroEL is described, for example, in the 
reference Hendrix, R.W. 1979, J. Mol. Biol. 129:375-392. 

The chaperonin may be naturally occurring in the host cell in which case the 
host cell comprising the chaperonin is subjected to conditions that result in an 

25 increase in the production of the chaperonin. This provides elevated levels of 

chaperonin to which the chaperonin binding domain attaches. Methods for inducing 
the natural levels of chaperonin in a host cell include heat shock (Welch W.J., 1993, 
Philos Trans R. soc Lond B Biol Sci vol. 339, pages 327-333); chemical shock, such 
as by the addition of ethanol, methanol, glucose, and drugs such as those described 

30 in Volker et al., 1994, Microbiology, vol 140: pages 741-752; Barbosa et al., Gene, 
1994, vol. 148, pages 51-57; and Hartke, 1997, Curr Microbiol, vol 34, pages 23-26. 

The chaperonin may be heterologous to the cell and introduced into the cell 
via recombinant means. The heterologous chaperonin, or portions or fragments 
thereof capable of binding to the chaperonin binding domain, may be introduced via 
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a replicating plasmid or integrated into the host genome by means known to those of 
skill in the art. The present invention also encompasses host cells having additional 
copies of homologous chaperonins, or portions or fragments thereof capable of 
binding to a chaperonin binding domain, introduced into the cell. 

5 In one illustrative example disclosed herein, the host cell used was E.coli 

which naturally produced the GroEL chaperonin as well as the GroES co-chaperonin, 
and further comprised nucleic acid encoding a fusion of the chaperonin binding 
domain having the amino acid sequence EVETKSAGGIVLTGSAAA with the mouse 
apoptosis modulator protein, Bax. The recombinant E.coli was subjected to growth 

10 conditions that stimulated overproduction of the naturally occurring GroEL 
chaperonin and expression of the fusion protein was observed. 

II. Chaperonin binding domain 

The present invention encompasses chaperonin binding domains that are 
15 capable of binding to a chaperonin with an affinity of between 1CT 2 and10~* Kd. 

Examples of chaperonin binding domains are provided in Table I. Table I provides 
the sequence of chaperonin binding domains and a list of the respective 
microorganism from which the binding domain is obtained (Hunt et al., 1996, Nature 
vol. 379, pages 37-45). 

20 

Table I 



Organism 


Chaperonin binding domain 


chlO ecoli 


EVETKSAGGIVLTGSAAAK 


chlO acype 


EVESKSAGGIVLTGSAAGK 


chlO haedu 


EVETCSAGGIVLTGSATVK 


chlO pseae 


EEETKTAGGIVLPGSAAEK 


chlO chrvi 


EEERLSAGGIVIPDSATEK 


eg 10 coxbu 


EEERTSAGGIVIPDSAAEK 


chlO legmi 


EEERTTAGGIVIPDSATEK 


chl3 rhime 


ESEEKTKGGIIIPDTAKEK 


chlO legpn 


EEERTTAGGIVIPDSATEK 


chlO bruab 


ESEAKTAGGIIIPDTAKEK 


chl2 braja 


DAEEKTAGGIIIPDTVKEK 


chlO agrtu 


ESEAKTKGGIIIPDTAKEK 


chlO cloab 


EAEETTKSGIVLPSSAKEK 


chlO amops 


EEERTTAGWIVIPDSATEK 


chll rhime 


ESEEKTKGGIIIPDTAKEK 


chlO lacla 


EEEEKSMGGIVLTSASQEK 


chlO stral 


DAEQTTASGLVIPDTAKEK 
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en l \j incp j 


FTFFKTASGIVLPDTAKEK 


chlO bacsu 


FSFFKTASGTVI PDSAKFK 


chlO bacst 


FTFFKTASGIVI PDTAKFK 


1 f\ rvi \ir*t\ i 

en i u myciu 


EAETTTASGLVIPDTAKJEK 


i j uidjd. 


DAEEKTAGGIIIPDTAKEK 


/>h1 fi ctQOIl 


EQEQTTKSGIVLTDSAKEK 


1 fi m \ ir* r\ 


EAETTTASGLVIPDTAKEK ! 

1_ f-^Vl—^ 111. iikjVJi^ ▼ 11 1— ^ 1 iilVUrlV 


r* Vi 1 0 m ur 1 p» 
ciiiw iiiyeic 


EAETMTPSGLVIPENAKEK 


CI 1 1 VJ C HJJJC 


EAEETTKSGIIVTGTAKER 


^Vi 1 0 c \/n r^7 
L-ii i vj oynp / 


EAEEKTAGGIILPDNAKEK 


r\\ 1 A c\rr\wf\ 
\,\\Y\J bynpo 


EAEEKTAGGIILPDNAKEK 


cniu synyj 


PAFFKTAGGII I PDNAKFK 


chlO chlpn 


EEEATARGGIILPDTAKKK 


chlO lepin 


QEAEEKIGSIFVPDTAKEK 


chlO chips 


EEDSTARGGIJ LPDT AKKK 


chlO chltr 


EEASTARGGIILPDTAKKK 


chlO rat 


AAETVTKGGIMLPEKSQGK 


chlO bovin 


AAETVTKGGIMLPEKSQGK 


chlO ricts 


QNDE. AHGKILIPDTAKEK 


chlO spiol 


EVENKTSGGLLLAESSKEK 


chlO arath 


IQPAKTESGILLP . EKSSK 



In a preferred embodiment, the chaperonin binding domain is the sequence 
EVETKSAGG I VLTGSAAA or portions or variations thereof which bind to the GroEL 
chaperonin with an affinity of between about 10~ 2 to about 10" 8 Kd. 

5 For construction of a fusion protein, the chaperonin binding domain may be 

directly linked to the desired protein, peptide or polypeptide, or indirectly linked, ie 
comprising additional nucleic acid between the nucleic acid encoding the chaperonin 
binding domain and the protein or peptide or polypeptide. Such additional nucleic 
acid may encode enzymatic cleavage sites or chemical cleavage sites. Nucleic acid 

10 encoding the chaperonin may be 5' or 3' to the nucleic acid encoding the protein, 
peptide or polypeptide. 

III. Expression systems 

The present invention encompasses expression vectors and host cells 
15 comprising a chaperonin binding domain for the production of proteins, peptides or 
polypeptides in host cells. Nucleic acid encoding a chaperonin binding domain can 
be isolated from a naturally occurring source or chemically synthesized as can 
nucleic acid encoding a desired protein, peptide or polypeptide. Once nucleic acid 
encoding a binding domain of the present invention, or a protein, peptide or 
20 polypeptide, is obtained, fusion proteins comprising the chaperonin binding domain 
and the protein, peptide or polypeptide and recombinant host cells comprising the 
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fusion proteins may be constructed using techniques well known in the art. 
Molecular biology techniques are disclosed in Sambrook et al., Molecular Biology 
Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY (1989). Nucleic acid encoding a chaperonin binding 
5 domain and/or protein is obtained and transformed into a host cell using appropriate 
vectors. A variety of vectors and transformation and expression cassettes suitable 
for the cloning, transformation and expression in host cells are known by those of 
skill in the art. 

Typically, the vector or cassette contains sequences directing transcription 
10 and translation of the nucleic acid, a selectable marker, and sequences allowing 
autonomous replication or chromosomal integration. Suitable vectors comprise a 
region 5' of the gene which harbors transcriptional initiation controls and a region 3' 
of the DNA fragment which controls transcriptional termination. These control 
regions may be derived from genes homologous or heterologous to the host as long 
15 as the control region selected is able to function in the host cell. 

Initiation control regions or promoters, which are useful to drive expression of 
the chaperonin binding domain, or fusion protein comprising the chaperonin binding 
domain, in a host cell are known to those skilled in the art. Virtually any promoter 
capable of driving expression is suitable for the present invention. Nucleic acid 
20 encoding the chaperonin binding domain is linked operably through initiation codons 
to selected expression control regions for effective expression of the chaperonin 
binding domain. 

Once suitable cassettes are constructed they are used to transform the host 
cell. General transformation procedures are taught in Current Protocols In Molecular 
25 Biology (vol. 1, edited by Ausubel etal., John Wiley & Sons, Inc. 1987, Chapter 9) 
and include calcium phosphate methods, transformation using PEG and 
electroporation. 

A host cell which contains the coding sequence for a chaperonin or 
chaperonin binding domain of the present invention and expresses the protein may 
30 be identified by a variety of procedures known to those of skill in the art. These 

procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridization and 
protein bioassay or immunoassay techniques which include membrane-based, 
solution-based, or chip-based technologies for the detection and/or quantification of 
the nucleic acid or protein. 
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A host cell comprising a fusion protein comprising a chaperonin binding 
domain is used to express proteins, peptides or polypeptides which are normally 
toxic to the host cell. A toxic protein may affect the growth of the cell due to its 
intrinsic qualities or due to the affects on the cell due to overexpression. 
5 The manner and method of carrying out the present invention may be more 

fully understood by those of skill in the art by reference to the following examples, 
which examples are not intended in any manner to limit the scope of the present 
invention or of the claims directed thereto. 

10 EXAMPLES 

Materials and Methods 

Genes, strains, media and growth conditions. 

E. coli strain JM83 was obtained from Dr. Deborah Hanson, Argonne 
National Laboratory and strain JM105 was from Pharmacia. For cloning, strains were 

15 cultured on LB or 2xYT medium with 100 pg/ml ampicillin where appropriate, and for 
physiological studies, strains were grown on LB medium supplemented with 
appropriate carbon sources and electron acceptors. The plasmid pASK40 was from 
Skerra et al., 1991, Bio/Technology 9:273-278, pJF118EH was from Dr. Michael 
Bagdasarian (Furste et al., 1986, Gene 48, 1 19-31), pTRC99a and pUC19 were from 

20 Pharmacia. Reagents used in polymerase chain reactions (PCR) were from Perkin- 
Elmer; isopropyl-b -D-thiogalactopyranoside (IPTG) was from US Biochemicals; 
enzymes for molecular biology were from Promega, Inc.; all other chemicals were 
purchased from Sigma (St. Louis). 

The pSK-mBax plasmid containing the cDNA encoding murine-Bax was a gift 

25 from Dr. John C. Reed from the La Jolla Cancer Foundation. The plasmid containing 
murine-Bc/-2 cDNA was provided by Dr. Timothy J. McDonnell at the University of 
Texas M. D. Anderson Cancer Center. Nucleic acid sequences for murine bax and 
murine bcl-2 found in GenBank entries L22472 (Oltvai et al., 1993, Cell, 74:609-619) 
and M16506 (Negrini et al., 1987, Cell, 49:455-463) were used to design PCR 

30 primers that would 1 ) introduce an EcoRI site at the 5'-end of the gene to allow 
cloning into the vector pASK40 in the correct reading frame, and 2) encode an 
additional five histidine residues at the 3'-end of the gene to facilitate purification of 
expressed proteins (Hochuli et al., 1988, Bio/Technology 6:1321-1325). MDH was 
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previously cloned from genomic E. coli DNA by PCR into pASK40 and pTRC99a 
(Boernke, et al (1995) Arch. Biochem. Biophys. 322:43-52). 



Table 2. Plasmids used. 



Plamsid 


Components 


Source and/or reference 


_ A O 1/ A f\ 

pAbK40 


old, laciq, K/vJ/ac, i ipp, ti-ikd, omp/\ 


^oKerra ei di., buprdj 


rt ir-t<( r>f— i i 

pJFI lobn 


D/a, isciq, rldC 


^rursie ei ai., bupraj 


pAT PUU4 


oTOto-ioop encoaing leaaer in pjri lotn 


inib appiiudiiun 


pA I P UUo 


Muiticioning sue or puu iy suDsiiiuiea Tor 


into dppiiudiiun 




that OT Jrllobn in pAI KUU4 




pMDrll 


t. co// rncrn in pAoi\4U 


^DoernKt? ei ai., bupra^ 


pMDH13 


E. colimdh in pTRC99a 
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Construction of vectors for expressing chaperonin binding domain fusion 
proteins 



Vectors designed to comprise the chaperonin binding domain obtainable from 
GroES were constructed as follows. Oligonucleotides were synthesized based on the 

10 published amino acid sequence of the E. coli GroES protein. Residues 16 through 
33 comprise the chaperonin binding domain EVETKSAGGIVLTGSAAA. A 
nucleotide sequence encoding this sequence and its complement were generated by 
the program Lasergene (DNAStar, Inc., Madison, Wl.) using the general codon 
preferences for E. coli. A linker was designed to require the same reading frame as 

15 that required in pASK40. Additional nucleotides encoding the overhang generated 
by an EcoRI digest of DNA and an ATG initiation codon were included at the 5'-end 
of each to give: 
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Oligo ATP6: 5'-AATTATGGAAGTTGAAACCAAATGTGCTGGTGGTATCG- 

-TTCTGACCGGTTCTGCTGCTGCG-3' 
Oligo ATP7: 5'-AATTCGCAGCAGCAGAACCGGTCAGAACGATACCACCA- 
-GCAGATTTGGTTTCAACTTCCAT-3' 
5 The design of the linker for attachment of the GroEL-binding domain of GroES to 
proteins is shown in Figure 5. 

The linker was phosphorylated using T4 polynucleotide kinase, purified with 
Qiaex resin (Qiagen), and ligated into the dephosphorylated EcoRI site of the vector 
pJF1 18EH. Transductants of strain JM105 were screened for the presence of an 
10 EcoRI site, which is present in the linker but absent in pJF1 18EH. Positive 

transductants were screened for orientation of the insert. Only one end regenerates 
an EcoRI site, and in the correct orientation that site is adjacent to a multicloning 
site. EcoRI-EcoRV digests were analyzed on a 1.5% agarose gel for the presence 
of the 1 ,260 base pair fragment predicted for the correct orientation as opposed to 
15 the 1 ,200 base pair fragment predicted for the reverse orientation. The new vector 
was designated pATP004. 

Because the multi-cloning site of pATP004 was limited in the number of sites 
available for cloning, we exchanged this cluster for the larger cluster of pUC19 using 
the enzymes EcoRI and Hindlll. Plasmids obtained from JM105 transductants were 
20 screened for orientation, as above, and for the presence of the Kpnl and Xbal sites 
present only in the pUC19 multicloning site. The resulting plasmid was designated 
pATP005. 

Expression and analysis of proteins 

25 Cultures containing plasmids were grown in LB medium to A600 0.2-0.5 then 

induced with 0.1 mM IPTG. To prepare cytoplasmic fractions of total proteins, cells 
were lysed by treatment with lysozyme. Expression of noncatalytic proteins was 
estimated by conventional gel electrophoresis. Gels were scanned with a UMax 
Powerlookll scanner and the images were analyzed with Digital Science 1D software 

30 (Eastman Kodak). For more rigorous quantitation, 2-dimensional gel electrophoresis 
was performed. 

Two-dimensional gel electrophoresis 
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Cell pellets (7.85 OD600 units) were lysed in 1 mL of a solution containing 
9 M urea, 2% 2-mercaptoethanol, 4% Nonidet P40, and 2% ampholytes (BioRad pH 
8-10). The resulting homogenates were centrifuged for 5 min at 435 x g in a 
Beckman TL100 tabletop ultracentrifuge to remove particulates. The supernatants 

5 were then frozen at -70° C until electrophoresis. Isoelectric focusing in the first 
dimension was done essentially as described by Anderson and Anderson (1978, 
Anal. Biochem, vol. 85, pages 33-340) using 50% pH 3-10 Biolyte, 25% pH 5-7 
Biolyte, and 25% pH 5-7 Servalyte carrier ampholytes. After equilibration in sodium 
dodecyl sulphate buffer (O'Farrell, 1975, J. Bio. Chem., vol. 250, pages 4007-4021) 

10 the focused proteins were separated in the second dimension in slab gels containing 
a linear gradient of 10-17% polyacrylamide, essentially as described by O'Farrell, 
supra, with modifications described by Anderson and Anderson, 1978, Anal. 
Biochem, vol. 85: pages 341-354. After electrophoresis, gels were fixed and stained 
in 0.125% (w/v) Coomassie Blue R250 in 2.5% phosphoric acid and 50% ethanol for 

15 approximately 24 h. The gels were then destained in 20% ethanol. Protein patterns 
were digitized using an Eikonix 1412 CCD scanner interfaced with a VAXstation 
4000-90. Image processing and generation of parameter lists, referred to as spot 
files, were as previously described Anderson, 1982. A master pattern was created 
using a copy of a 2DE spot file from the "ethanol and induced" treatment group so 

20 that the Bax protein was represented. Each of the patterns generated for the 

experiment, with four replicate patterns within each sample group, was matched to 
the master and then examined interactively for the appearance of new proteins, the 
loss of normally expressed proteins, and for statistically significant quantitative 
differences between the control patterns and the "induced" or "ethanol and induced 

25 patterns". 
Example I 

Construction of Expression Vector 

A synthetic linker designed to encode the chaperonin binding loop of GroES 
(EVETKSAGGIVLTGSAAA) was ligated into the EcoRI site of plasmid pJF118EH. 
30 Plasmid DNA was prepared from representative colonies arising from transformation 
of E. coli JM105 with this ligation mixture, and screened for the presence of an Agel 
site, which is unique to the introduced linker. Of 10 colonies screened, all contained 
the site. The plasmids were then screened for orientation of the linker, since it could 
be incorporated in two directions. The linker was designed so that only one EcoRI 
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site would be regenerated, which in the desired orientation would be attached to the 
multi-cloning site. Plasmids were digested with EcoRI and EcoRV (present in the tac 
promoter) and analyzed on a 1 .5% gel. The desired orientation, present in 5 of the 
plasmids, generated a 1,260 base-pair fragment; those in the wrong orientation, with 

5 the reconstituted EcoRI site at the downstream side of the inserted linker, generated 
a fragment of 1 ,200 basepairs. A representative of the correct orientation was 
propagated and designated pATP004. 

To expand the potential of the vector, the small multi-cloning site of 
pAF1 18EH was excised from pATP004 by digesting it with EcoRI and Hindlll, and 

10 replaced with the EcoRI-Hindlll multi-cloning site of pUC19. Of eight colonies 

screened, seven contained the inserted multi-cloning site. These were further shown 
to have the expected orientation and the additional restriction sites. A representative 
colony was designated pATPOOS. 
Example II 

15 Insertion of genes into the expression vectors. 

The vectors described above initiate protein synthesis at the ATG codon in 
the new linker sequence that encodes EVETKSAGGIVLTGSAAA. The vector's 
multi-cloning site follows this linker, and genes are introduced into the cloning sites 
such that their reading frame matches that of the linker. 

20 The gene encoding mouse Bax was cloned from the pSK-mBax vector and 

the gene encoding murine Bcl-2 was cloned from cDNA by PCR as described in 
Materials and Methods. The bax gene was subsequently moved into the EcoRI- 
BamH1 sites of pATP004 to give the vector pATP01 1 . Genes encoding the BAX 
homolog Bcl-2 and the E. coli MDH were also recloned into pATP004 by standard 

25 methods. 
Example III 

Expression of the fusion protein comprising BAX-chaperonin binding 
domain 

Induction of BAX was compared for three classes of genetic constructs 
30 designed to express the BAX gene: 1) without a leader sequence, 2) with the OmpA 
leader, and 3) with the chaperonin binding domain obtainable from GroES. 

The possible effect of the GroES-loop leader sequence on expression of BAX 
protein was evaluated in cultures that enhance the expression of E. coli chaperones 
in the cell. The rationale was based on the assumption that folding of expressed 
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fusion proteins would be mediated through interaction with the chaperonin GroEL. 
When grown in the presence of moderate concentrations of ethanol, E. coli is known 
to induce higher levels of chaperones and other stress proteins, Barbarosa, supra. 
We first evaluated the effect of various concentrations of ethanol on the growth of 

5 the host strain, JM109. A concentration of 2% ethanol (0.44 M) reduced the growth 
rate approximately 2-fold and was chosen for induction studies (Fig. 2). 

Only those transformants containing BAX as a fusion with the chaperonin 
binding domain obtainable from GroES, showed production of the BAX protein. 
When those transformants without a leader and not producing BAX were induced, 

10 their growth was unaffected. Extracts prepared from these cultures showed patterns 
of proteins that were identical for the induced and uninduced cultures; there was no 
evidence of expression of a heterologous protein, either by the appearance of a new 
band of the expected molecular weight or by alterations in the abundance of E. coli 
host proteins that typically occur on overexpression. 

15 Apparently none of the transformants contained a functional BAX expression 

system (DNA sequencing analysis of the 5'-end of the genes indicated that the 
constructs were as expected in that region). In striking contrast, induction of 
transformants constructed to produce BAX with either of the leader sequences 
(OmpA or GroES-loop) caused an immediate cessation of growth. 

20 Uninduced cells grew normally. In contrast IPTG-induced cells growth 

dropped immediately following introduction with IPTG (Fig. 3). This apparent strong 
toxicity of BAX was moderated when cultures were grown in the presence of ethanol, 
but only in the case of the BAX chimera containing chaperonin binding domain 
obtainable from GroES (solid circles in Fig. 3). Analyses of the proteins present at 

25 the end of this period of induction (Fig. 4) revealed the presence of a new protein of 
the anticipated molecular weight (indicated by the "bar" in the low molecular weight 
region (the BAX molecular weight is 21 ,419 Daltons) of the gel between lanes 3 and 
4 and between lanes 7 and 8). The amount of this new protein was quite low in each 
case except that of having the chaperonin binding domain obtainable from GroES 

30 produced in cells grown in the presence of ethanol (Fig. 4). Densitometry of this 
region of the gel indicated that the presence of ethanol resulted in at least a 15 fold 
increase in the amount of the BAX produced. Under these conditions inclusion of 
ethanol induced GroEL production approximately 1.5-1.8 fold. 
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Due to the presence of other proteins of similar molecular weight, the 
estimates of relative production of the BAX and GroEL proteins from 1-D gels are 
inaccurate. Therefore the extracts were separated by 2-dimensional gel 
electrophoresis (DE). Samples were separated by 2-DE and the resulting protein 

5 patterns were analyzed for qualitative and quantitative differences as described in 
Materials and Methods. The extent of the overexpression of BAX and its 
enhancement caused by inclusion of ethanol in the culture was found to be greater 
than that suggested by the 1-dimensional gel analysis. In the absence of IPTG 
inducer, no protein attributable to BAX was detectable. 

10 The protein pattern analysis indicated that of all the proteins separated on 

2DE, only four were overexpressed: GroEL, Hsp70, the fusion of chaperonin binding 
domain and BAX, and serine hydroxymethyl transferase (Table 3). 

Table 3 . Qualitative analysis of protein expression. 





condition 


avg. density 


relative amt 


DnaK 


control 


nos. 


1 




+IPTG (A) 


nos. 


0.7 




+IPTG, +EtOH (B) 


nos. 


2.1 


GroEL 


control 


nos. 


1 




condition 


avg. density 


relative amt 




+IPTG (A) 


nos. 


0.85 




+IPTG, +EtOH (B) 


nos. 


2.8 


SHMT 


control 


nos. 


1 




+IPTG (A) 


nos. 


0.6 




+IPTG, +EtOH (B) 


nos. 


1.6 


fusion with BAX 


control 


0 


0 




+IPTG (A) 


3089 






+IPTG, +EtOH (B) 


45494 


14.7 rel to -EtOH 



15 

No density above background was detected for the fusion with BAX in the 
absence of IPTG inducer. The small amount formed when IPTG was present was 
increased 14.7 fold when cultures were grown with 2% ethanol. Under these 
conditions, GroEL and Hsp70 were enhanced 3 fold above the amount observed 
20 when only IPTG was present. 
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Various other examples and modifications of the foregoing description and 
examples will be apparent to a person skilled in the art after reading the disclosure 
without departing form the spirit and scope of the invention, and it is intended that all 
such examples or modifications be included within the scope of the appended 
5 claims. All publications and patents referenced herein are hereby incorporated in 
their entirety. 
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